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ABSTRACT 



1 INTRODUCTION 



Since 1572, when Tycho's nova was first observed, 
the astronomical community has come a long way in our 
search and understanding of stellar cycles and processes. 
Still at magnitude 12, it is estimated th at at least 90 
per cent of the variables are unknown (jEver fc BlakeJ 
2005). Since Hipparcos began the multi-epoch photometric 
survey, there have been many grou nd based project s 
like the All-Sky Automated Survey [IPoimariski] [2 000). 
Hung arian Automated Telescope (HATnet; Bakos et al.J 
2002J) , Wide Angle Se arch for Planets (WASP/SuperWASP; 



Pollacco et al. , 2006) as well as the space missi o ns lik e 



COROT (jBaglin. Vauclair fc The COROT Team] |2000|) , 
Micro- variability and Oscillation s of STars (MOST ; 
iMatthews et al.J l200Ch . Kepler (iBlomme et al.J |2010( ). 
Solar Mass Ejection Imager ( SMEI ; ISprecklevJl200sl) which 
are dedicated to different types of variability studies and 
many of them have been hugely successful in not only 
discovering new exopl anets but also unrav e lling the stellar 
variability mysteries IpVIichel et al.J 120051 ; iPribulla et al.J 
l2010l ; lBasri et al..j|2010l . etc.) 

In this paper, we present the use of twin Solar 
TErrestrial RElations Observatory (STEREO) for the 



The Solar TErrestrial RElations Observatory - STEREO, is a system of two 
identical spacecraft in Heliocentric Earth orbit. We use the two Heliospheric Imagers 
(HI), which are wide angle imagers with multi-baffle systems to do high precision 
stellar photometry in order to search for exoplanetary transits and understand stellar 
variables. The large cadence (40 min for HI-1 and 2 hrs for HI-2), high precision, wide 
magnitude range (R mag - 4 to 12) and broad sky coverage (nearly 20 percent just 
for HI-1 A and 60 per cent of the sky in the zodiacal region for all the instruments 
combined) of this instrument marks this in a space left largely devoid by other current 
projects. In this paper, we describe the semi-automated pipeline devised for the 
reduction of this data, some of the interesting characteristics of the data obtained, 
data analysis methods used along with some early results. 
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study of stellar variability and Exoplanetary transits. The 
STRESS project occupies a very interesting position in 
this scenario in terms of the location of its field of view, 
size of the field and magnitude coverage. The spacecraft 
covers a portion of the sky classically avoided for these type 
of studies, i.e., in the zodiacal path. As to the planetary 
transit searches, all of the dedicated space-based projects 
are concentrating away from this region. Hence this leaves 
a gap in the science for this particular sky region, which 
we are trying to fill in this project. Also, the pass band of 
this instrument lies very close to the R magnitude of the 
stellar intensity spectrum. Hence we find that many of our 
tracked objects have actually no literature or very little 
basic information. 



In Section 2, we describe the spacecraft and the 
instrumental characteristics, in Section 3, the data reduc- 
tion pipeline and then the general characteristics of the 
data in Section 4 and some early result in Section 5 and 
summarising in Section 6. 



2 STEREO SPACECRAFT 



* E-mail: vs@star.sr.bham.ac.uk 



STEREO is a set of two identical spacecrafts launched 
on October 25, 2006 to study the characteristics and 
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Table 1. HI Instrument Characteristics 



instruments. 



HI-1 HI-2 



Field of View 


20° X 20° 


70° x 70° 


Plate scale 


35. 15" /pixel 


2.05'/pixel 


Passband 


6300A - 7300A 


4000A - 10000A 


Nominal exposure time 


40 sec 


50 sec 


Integrated cadence 


40 min 


2 hrs 



evolution of Coronal Mass Ejections (CME) in stereoscopic 
projection. The two spacecrafts - one ahead of Earth (A) 
and one trailing behind Earth (B) - are in an Heliocentric 
orbit each separating from Earth at a rate of 22° per 
year. A pair of Heliospheric Imagers (HI) in the SECCHI 
suite of instruments for each spacecraft is of particular 
interest to do stellar photometry due to its near continuous 
observation of the interplaneta ry region from 12 to 215 
solar radii (|Howard et al.j|2008h . 

The HI-1 and HI-2 cameras have individual exposure 
times (equivalent to single shutter open/close cycle) of 
40 and 50 seconds. Thirty such exposures are summed 
up together on board the spacecraft, after correcting for 
cosmic ray effects, thus providing a cadence of 40 minutes 
for each HI-1 image transmitted to the ground. Similarly 
99 exposures of HI-2 are summed to provid e a digitally 
integ rated image cadence of 120 minutes (see lEvles et al.J 
120091 for detailed explanation). The actual 2048 x 2048 
images are also 2x2 binned, resulting in 1024 x 1024 images 
for all the instruments, thus giving a final resolution of 
70.30" /pixel and 4.10'/pixel for HI-1 and HI-2 respectively. 



3 DATA REDUCTION 

In this paper we describe the general analysis 
pipeline developed for all the HI instrument and 
specifically its application to the HI-1 A instrument 
and the results derived thereafter. HI-1 A instru- 
ment is preferred over the other HI instruments for 
this basic analysis due to its well denned PSF and 
also because it is devoid of many artifacts present 
in the other HI instruments. At a later stage, we 
plan to analyse and integrate the data from other 
HI instruments in order to increase the quality and 
quantity of light curves obtained. Here we describe 
the semi-automated photometry pipeline (see Fig. [5] for 
a schematic) developed with the help of the Rutherford 
Appleton Laboratory (RAL) using IDL solar soft packages 
using data for a period of 37 months from April 2007 
till March 2010. Data from all the STEREO spacecraft 
instruments are publically available for download from 
the UK Solar System Data Centre (UKSSDC) websiteQ in 
an interactive way allowing the user to download images 
for various periods of interest from the different SECCHI 

1 www.ukssdc.rl.ac.uk 



3.1 Pre-processing of data 

There are three different levels of processed data 
available - Raw summed images (L0), which are the on 
board summed images produced after removing the cosmic 
ray hits. The level one images (LI ) are pre-processed 
using a pipeline developed by RAL (Br own et al.J 120091 ') 
to remove the various instrumental and general spacecraft 
effects explained below: 

Flat-field - A flat field map, produced before the 
launch of the spacecraft during the ground calibration of 
instruments, is applied to the images to compensate for the 
differential gain across the CCD pixels. 

Shutterless Readout - The HI cameras do not have a 
shutter, so that many short exposure images can be taken. 
A shutter less camera induces different exposure times to 
different parts of the image and smears it vertically. A well 
understood mathematical correction of the form 
given below is applied to the ima ge to de couple this 
effect from the images (See lEvles et al.J [20091 for a 
detailed explanation). The total response R for a 
pixel (n, m) is given by 

R(n,m) = [T exp x I(n,m)] + S( H=m+ i, 2 047) [Tdear 

X I{n,y)] +S(3, = o, m _l) [Tread X I{ n ,y)] 

Where the count rate for the pixel (x, y) is given 
by I(x,y), T exp is the exposure time, T c i ear and 
T rea d are the line transfer time during the previ- 
ous exposure clearing and current exposure reading. 

Missing Data - Blocks of data which were lost due 
to telemetry drop-out are replaced by NaN values in the 
images. 

Bad Files - Certain images are tagged as bad due to 
various reasons like corrupted data, telemetry dropouts, 
split star field, integer overflow, etc. These are identified 
using a bad file list provided by the UKSSDC and removed 
from further analysis. 

Blooming - For magnitude brighter than 4 (m < 4), 
HI-1 pixels saturate and HI-2 pixels saturate for magnitudes 
brighter than 1 (m < 1). Such columns are identified and 
replaced with NaN values. 

Updated Pointing Information and Optics Parameters 
- Raw data contains only nominal pointing information and 
camera optics as calculated from pre-launch parameters. In 
the initial stages of analysis, the instrument pointing values 
were found to have larger offsets than those predicted. 
Hence an updated spacecraft pointing, along with optical 
parameters, is calculated using star fields in the image and 
the header is updated with the new information during this 
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Figure 1. Left - Sample image with the solar F-corona present. The line running across the image is the blooming effect caused by a 
planet which can be seen prominently in the image on the right. Right - The same image after the subtraction of a running window map 
of one day which removes most of the contribution from the F-corona, thus enabling us to do stellar photometry. We can also identify 
the planet as well the solar streamers in this image, which were impossible in the image on the left. The image in the right is the data 
image that we use in further analysis. We can also understand from this figure the influence of the solar phenomenon in the photometric 
reduction and analysis. 



stage. 

Even after this process, we found that certain frames 
cannot be used for further reduction due to various reasons. 
The most common cause among them was found to be due 
to certain spikes and black outs in the data which are not 
tagged by the previous routines and these bad frames were 
removed automatically using a five sigma clipping. 



3.2 F-corona subtraction 

The data at this stage (LI) still has the dominant solar 
F-corona, which is the scattered sunlight from the dust 
particles in the inner Heliosphere. This light extends till 
Earth's orbit and merges with the zodiacal light at night. 
The brightness of this corona varies from 1CP 12 to 10~ 6 
times the brightness of the solar disk. Fig. [1] illustrates the 
need for the removal of the F-corona in order to do stellar 
photometry. The dark line running through the column is 
due to blooming of objects brighter than m = 3, which 
can be seen prominently in the figure on the right. Fig. [2] 
contains a plot of the image through the middle row with 
the F-corona present and when it is removed. It is clear 
from this plot the dominance of F-corona in the data. The 
stars which are almost invisible in the top plot becomes 
clearly visible in the bottom plot. The brightest of the 
objects at around 400 and 600 pixels are 6th magnitude 
stars and the fainter ones which are barely visible in the 
bottom plot are the 11th and 12th magnitude stars. During 
the initial stages of the project, a minimum background 
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Figure 2. This is a slice of the image through the centre before 
and after the F-corona subtraction. The spikes are the bright stars 
in the field which are barely visible in the top image and we can 
identify even the fainter stars when the F-corona is removed. 

map is obtained by computing the minimum value of each 
pixel for a set of images. After experimenting with different 
number of images and studying the residual F-corona 
and its variation present in the data, we found a 
day worth of images, i.e., 36 images, to be a best 
fit. This map is then subtracted from each of the images 
which removes most of the F-corona. 

The light curves computed after this method of F- 
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coronal removal has a dominant signal of around one day 
corresponding to the minimum map used for subtraction 
and this made studying periods around one day impossible. 
Recently, RAlfl formulated a new method of F-corona 
subtraction using a running window either of one day or 
eleven days and within that window select the lowest 25 
per cent of the data and form an average of this data on 
a pixel by pixel basis, which is then subtracted from the 
images. Two different window lengths are used in order to 
study different types of solar phenomenon. In the case of 
stellar photometry, the one day window seems to improve 
the photometry of the data much better than the eleven 
day window. To further understand the effect of this new 
background subtraction in our data, we computed a pa- 
rameter, the Fill factor (F), which essentially gives the 
percentage of data points available for an individual 
object or how well the star is tracked through a 
sequence of images. This is computed by dividing 
the number of data points present in the final light 
curve for the star by the total number of frames 
present from the moment when the star enters the 
field of view to when it leaves. In Fig. O we plot this 
fill factor, F against the standard deviation, a of the light 
curves. 

The data used in this plot is from around 7700 stars 
which are tracked when the instrument faces the galactic 
centre and hence involves a very crowded field, in order to 
consider an extreme case. The a are computed from raw 
light curves which contain the long term trends mainly 
caused by the variation of F-corona in time scales of the 
data. A fill factor of one indicates that the star is 
completely tracked through all the images and F < 1 
indicates a loss of data during its presence in the 
field of view. We find that in the running window 
method, there are many stars which are completely 
tracked whereas in the old background subtraction 
method none of the stars are tracked completely. 
Also from the data, we can infer that the 'raw' scatter 
of the data is close to 10 per cent photometric precision. 
There are a few objects below that scatter, close to the zero 
value of the fill factor, which are the result of very few data 
points available for computation and are not indicative of 
better photometry. But it has to be pointed out that this 
precision is obtained with all the systematic effects present 
in the data and hence there is a huge scope for improvement 
after treatment with various trend-removing methods. We 
did seem to reach a a of less than 1 percent when the 
systematics are removed as discussed in Section [4] 
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Figure 3. Plots of scatter vs fill factor for a sample data of 
around 7700 stars tracked for 10 days in the month of October 
2007. The top plot is using the one day minimum map as a back- 
ground and the bottom plot is using the new running window 
as the background for F-corona subtraction (see text for further 
explanation) . 



for nearly 1.1 billion stars derived from Hipparcos, Tycho-2, 
UCAC-2 and USNO-B1 catalogues for astrometric and 
optical data and 2MASS catalogue for infra-red photometric 
information. Som e of the catalogues are complete up to a V 
magnitude of 21 (|Monet et al.j|2003t ). 

We use the image header information to convert 
from the spacecraft to sky co-ordinates and thus identify 
targets using our input catalogue. The pixels are flagged 
as probable stellar sources, if the count is 1.5 times higher 
than the surrounding 4 by 4 region. When these sources 
are matched to the catalogue we use an error margin of 2 
pixel. The large pixels of the HI images induces PSF 
contamination between close targets of comparable 
magnitudes. We apply a basic magnitude-distance 
criterion to identify and remove a large portion of 
this effect. Thus, pair of objects which are closer 
than 10 pixels and have a magnitude difference of 
less than 0.5 are removed from the target list. We 
also remove the fainter star in these pairs within 
the same radius but with a magnitude difference 
of 2.5 or lesser. This is explained in Fig. [4l where 
in the left panel, there are two stars which are 
2 http://www.ukssdc.rl.ac.uk/solar/stereo/documentation/HI_processiigsfe2jdiaaihtli(il pixels away and are of comparative 



3.3 Stellar photometry 



is now extracted from 
data) using the NOMAD 



Photometric information 
these images (k n own as L2 

l|Zacharias et al.] I2004T I as input catalogue for stars 
brighter than R = 12 by our semi-automated 
pipeline which is explained below. NOMAE0 - Naval 
Observatory Merged Astrometric Dataset has information 
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Figure 4. STEREO HI-1A images of stars. Left panel: An ex- 
ample of two stars which are at a distance of less than 10 pixels 
contaminating each other and have a magnitude difference of 2.5 
or lesser. Right panel: An example of two stars of comparable 
magnitude which are well separated such that their PSF does not 
overlap and are well distinguished at a separation of 10 pixels. 

brightness that they contaminate each other and 
in the right panel, there are two stars which are 
clearly distinguishable when separated by more 
than 10 pixels. In the case of bright stars, this just 
resolves the two PSF. In order to detect further 
contamination, which is not removed by this filter, 
we use three different aperture sizes for calculating 
the flux. By using the biggest aperture for the 
brighter star and the smallest for the fainter star, 
we try to rule out further confusion. In spite of 
all these measures, we still find that targets are 
either missed out or mistaken for nearby closer and 
brighter ones. We discuss these issues in detail in 
Section 14.11 We would also like to note that in all 
these process, we are only dealing with the correct 
identification of targets and the issue of dilution 
of flux by nearby targets along with a noise analy- 
sis will be discussed in detail in a forthcoming paper. 

Generally, the PSF of HI-1A is well behaved in most 
regions of the CCD, although it deviates a little towards 
the edges of the frame. The PSF in the centre of the field 
is highly circular whereas it becomes elliptical at the edges. 
This effect is more prominent in other HI instruments. 
Hence, a more robust method of identifying the centroid 
of the targets is essential., Therefore, we use the method 
developed by Steve Spreckley for use with SM EI, where 
the PSF is of a very odd shape l|SprecklevJ l2008 )■ Aperture 
photometry is done on these selected targets with the 
inbuilt IDL routine for aperture radii 2.5, 3.0 and 3.5 
pixels and sky background is calculated between pixel 
radii 6.0 and 10. The CCD is assumed to have a quantum 
efficiency of 90 per cent for this spectral range and the 
photon-electron conve rsion gain for the camera is 15 units 
l|Bewsher et al.Jl201Ch . Sky variance, random photon noise 
and the uncertainty in the mean sky brightness contribute 
to the measured error values of the flux. The flux values 
mentioned in this paper are in units of DN/s and 
not converted to a magnitude scale. 

The time stamp in the final data is the time at the 
start of the observation in Geocentric Julian dates with 
correction for the sun-spacecraft distance applied. This light 
travel distance between the sun centre and the spacecraft 



is found from the image header keyword. We would like 
to note that this time of observation will vary across the 
CCD due to the large sky area covered but this difference 
would be very small compared to actual cadence and hence 
ignored. Also, due to the complicated integration of the 
data on board the spacecraft, the start of observation is 
found to be the best parametric representation of time. 
Fig. [5] gives a schematic of the analysis pipeline discussed 
so far. 

In the next section, some of the general characteristics 
of the data are explained in detail. 



4 DATA CHARACTERISTICS 

Complete data for each star, which comprises of the 
Right Ascension (RA) and Declination (Dec) marking the 
position, R magnitude obtained from the input catalogue, 
flux values computed from the pipeline for each aperture 
and its error values along with the sky background for each 
frame and its error values, is stored in an IDL data format. 
Typical time taken by an object to transit the HI-1 field 
of view is about 20 days (around 700 data points) and for 
HI-2 instrument it is about 70 days (around 2200 points). 
An integration of data from all instruments will give us 
around 180 days worth complete data set for each orbit of 
the spacecraft. Since the cadence of the HI-1 instrument 
is 40 minutes, the Nyquist frequency for this set of data 
is around 18 cycles per day (cd ) and for HI-2 it is 6 
cdT 1 . So far we have obtained data for nearly a million 
stars covering a period of 3.5 years. 

The data in general exhibits systematic trends which 
are the resultant of the basic instrumental design as well 
as trends which are common to all the stars in the region 
either due to some part of the processing pipeline or due 
to a systematic change in the basic conditions of the images. 

One of the most common phenomenon is the increase 
or decrease in the median intensity of the flux due to the 
presence of either a Solar-wind transient event or a CME 
in the image. Since the flux value is relative to both the 
actual pixel intensity as well as the different backgrounds 
which are being subtracted in various stages, the increase 
in intensity can be attributed to an increase in the flux of 
the whole region and decrease to a higher value of coronal 
background which is being subtracted and hence we find 
this effect being manifested in all the light curves in the 
region. In order to illustrate this effect, we have plotted the 
median intensity of all the light curves in a particular region 
for a period in which we know the camera encountered a 
Solar transient event Q in Fig. [6] The data used in this plot 
is from a very small strip-like region of CCD comprising 
of 10 pixels by 1024 pixels and we can clearly identify the 
time when the Solar wind cloud passed through this region 
imaged by the CCD from the high scatter of all the light 

4 T he event list is provided at the ST EREO site 
://www.sstd. rl.ac.uk/stereo/HIEventList. html 
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Figure 5. A schematic of the STEREO HI data analysis pipeline, showing the various stages of data preparation involved in producing 
light curves from raw images (see text for explanation). 
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Figure 6. A plot of the median value of all the light curves (12) 
against time in a small strip of 10 pixels by 1024, which encounters 
a Solar wind transient event. This clearly illustrates the effect of 
such events in all the objects in the region. 



curves in that portion. We calculate the median by first 
normalising the light curves with its median value before 
adding all the intensities and dividing by the total number 
of light curves. 

Fig. [6] also shows another common trend, which is the 
loss of flux at the end of the frame. Sometimes such a loss is 
also found at the start of the light curve. These effects are 
due to the variation of the PSF at the end of the CCD and 
are magnified in the other HI instruments. Another trend 
we commonly find is a systematic variation across the whole 
light curve for all the light curves. Since we can remove 
this trend by fitting a 4th order polynomial, which is the 
same as the CCD response function, we thi nk this effect is 
a manifestation of the response function l|Bewsher et al.J 



l2010h - A detailed discussion of this trend can be found in 
the next section. 

Another feature which we find in many light curves are 
sudden flux increase, similar to flares - a sharp and sudden 
increase for a few frames, mostly caused by the passage 
of the object either through a solar filament or through 
a bright object like a planet or a brighter star. These are 
highly transient and temperamental. 

All these above trends, though systematic, are 
highly time dependent. But there are also trends which 
are independent of the time of the observation but rather 
related to the magnitude of the object. This is the relative 
scatter of the flux against magnitude of the star. In 
an ideal case, when we remove all the systematic noise 
mentioned above, we can end up with the photon-shot 
noise limited flux values corresponding to the magnitude 
of the object under study and one of the advantages of 
a space based mission is that we can easily identify all 
the systematics and hence can achieve this limit much 
more easily than a ground based observation. But in this 
project, we are limited at a higher flux value due to the 
basic instrumental aim of studying solar phenomenon and 
not stellar variability analysis. Some of the limiting factors 
in this case are the presence of bright solar corona and 
planets, large image pixel, Galactic Plane crossing and last 
but not the least, a very small team dedicated to its analysis. 

Fig. [7] is a plot of the relative scatter (by relative 
scatter we mean the variance found after normalising the 
light curves) against R magnitude of stars tracked during 
September and October of 2008. The sample is represen- 
tative of general STRESS characteristics since the later 
part of the sample is through the highly crowded galactic 
plane region and the former is through a relatively sparsely 
distributed region. A one day boxcar average is smoothed 
from all the light-curves before calculating the scatter. 
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Figure 7. A plot of relative scatter of intensity against magnitude 
of the stars. 



This is to remove the residual effects of the F-coronal 
subtraction. Hence most of the variability in the data is 
either stellar or other systematics. Still, for most of the 
objects brighter than 8th magnitude, we have a relative scat- 
ter less than 1 per cent and for fainter objects a few percent. 



4.1 Post processing the Light curves 

From the above section, we find that there are many 
systematic variations present in the light curves. In this sec- 
tion, we present some of the methods attempted and finally 
used in removing or reducing these effects so as to analyse 
for periodicity in these data. Some of the well established 
methods in use for this type of analysis are the filtering 
methods like median smoothing and box-car average 
smoothing, polynomial fit ting and various new specialise d 
algorithms like Sys-Rem l|Tamuz. Mazeh fc Zucker] [2 005). 
Tren d Filtering Algorithm (TFA; Kovacs, Bakos & Noycs 
20051 . Dae Kim-won's algorithm l|Kim Dae- Won et al. 



20091 ). Even though these form the basis of many photomet- 
ric analysis, they have to be adapted specifically to our light 
curves due to the scanning nature of our instrument and 
the different background issues in the data. One of the main 
issues we have to deal with in our data, is the crowded field. 
Even after application of the confusion criteria mentioned 
in Section 13.31 we find that we are tracking two objects 
alternatively and this raises the scatter of the light curves. 
Usually this confusion occurs when we have two or more 
nearly similar brightness objects. Since we are applying the 
criteria for only pairs of objects, sometimes these objects 
are still tracked. 

Even though these light curves make up less than 1 per 
cent of the total, these 'alternating' light curves generally 
create a very periodic signal, since the tracking algorithm 
shifts the centroid very systematically depending on which 
star is in maximum at that point and hence often these light 
curves get flagged up as a periodic signal during the period 
analysis. Even a robust sigma clipping does not identify 
these objects since the actual variance of the light curve 
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Figure 8. Quality control in light curves: Above, we have plot- 
ted the cumulative frequency distribution of a light curve against 
actual intensity. The top plot is the distribution for a 'good' light 
curve, i.e, where only a single star is tracked throughout the CCD. 
The actual light curve is given as an insert in the plot. The bot- 
tom plot is the same distribution when two stars are tracked al- 
ternatively such as to give rise to a periodic signal. We can easily 
identify the 'bad' light curve from the inter-quartile range (IQR) 
which is given. The higher this value, the larger the spread of the 
light curve. This is a fairly robust method of identifying wrongly 
tracked objects in our data. 



itself is very high and thus is an inaccurate measurement of 
the scatter in the system. In order to flag these objects, we 
use the inter-quartile range measurements. Fig. [8] illustrates 
this method in detail. The top plot is the cumulative 
intensity against flux for a good light curve (see insert) and 
the bottom is the same for a highly scattered light curve, 
which is the tracking of two objects almost alternatively. 
Another method which was used initially as a cross check, 
is the difference in intensity for the various apertures. The 
fainter stars have the same flux value irrespective of the 
aperture size and the brighter stars just increase their flux 
values as we go to larger apertures since most of the star 
PSF is captured in larger apertures. But when there is 
a confusion between two stars, there is a random change 
in flux values irrespective of the increase in aperture size. 
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Figure 9. The top plot is the original light curve of V = 6.7 
star before applying any correction and the bottom plot is the 3 
hr smoothed function which is used as the light curve for further 
analysis. This ensures the removal of the high frequency noise in 
the data. This filter is applied only for stellar variability analysis 
and not for planetary transit analysis. 



Hence, they can be used to identify wrongly tracked objects. 

The light curves selected after using the quartile filter, 
has both a high frequency and a low frequency noise 
component in them. We first remove the high frequency 
noise by using a box-car filter corresponding to a box 
width of around 3 hrs. Since this is also the time scale of 
any planetary transiting event, we implement this filter 
only for stellar variability analysis. We use the resultant 
smoothed function for further analysis and the residual in 
the calculation of our signal-noise ratio when a stellar signal 
is confirmed. Fig. [9] illustrates how the use of smoothed 
function removes the high frequency noise in the data for a 
randomly selected standard star (HD 112281). This has a V 
mag of 6.70 and a fairly non-interesting star, which will be 
used in all further processing examples. The usage of this 
smoothed function as the light curve for further analysis 
does not significantly affect the standard deviation of the 
light curve. 

In order to remove the long term trends, two differ- 
ent methods were experimented with - a box-car average 
smoothing and polynomial fitting. After trying different box 
widths, a one day smooth was found as a best fit to many 
light curves. Removing a 4th order polynomial also tend 
to remove mos t of th e long term trend in the data. From 
iBewsher et "all (|2010h . we know that the CCD response 
varies exactly as a fourth order polynomial across the CCD 
central axis. And hence could be causing this systematic 
trend across all the light curves. One of the disadvantages of 
the polynomial fitting is that, when there is a larger stellar 
variability signal, this fitting becomes inaccurate and thus 

5 All the magnitude mentioned hereafter refer to information 
from the Simbad database for ease in cross checking. 
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Figure 10. The top plot illustrates how both a 4th order poly- 
nomial as well as a one day box-car average smoothing function 
fits to the light curve. The bottom two plots give the residuals 
after removing the polynomial and the box-car filter respectively. 



increases the variance of the final light curve. Even though, 
the polynomial is a theoretical fit to the trend, when the 
pipeline is automated, box-car smoothing is more reliable 
so as to remove only the systematics. Fig. \W\ shows how 
a box-car average of one day and a 4th order polynomial 
fits the long term trend in this sample light curve. We also 
find that the polynomial fit does not remove all the long 
term trend but only that specified by the CCD function 
whereas the smooth function removes all types of long term 
variation. This is certainly helpful when looking for periods 
relative to the length of the light curve. 

After removing the various known noise sources in 
the light curve, we analyse t hem for varia bility using 



Lomb-Scargle periodogram (LS; Scargle. 1982) and Phase 
Dispersion Minimisation fPDM: IstellingwerfJ 1 19781) . We 



use LS to flag up potential variables and then confirm the 
periodicity with PDM. Our initial analysis found that the 
LS is very sensitive to larger periods while PDM is sensitive 
to the shorter periods and thus using both increased the 
probability of finding a true variable and eliminating 
false positives. We also used Box-fitting Le ast Squares 
method (BLS; iKovacs. Zucker fc Mazeh] I2002T ) to look for 
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44 Tau : Spect. Type - F2 : V Mag - 5.39 




MJD - 2454000. MJD - 2454000. 

Figure 11. The above is the STRESS light curve of the 44 Tau system. The plot on the left is an illustration of the data availability for 
more than three epochs as of now. This aids in more precise period determination as well as characterising period changes in an object. 
The right plot is a zoomed in version displaying clearly its symphony of frequencies. 



exoplanetary transits. All of these periodogram analysis 
will be described in detail in a forthcoming paper. 



5 RESULTS 

In this section, we present some of the sample light 
curves which are obtained by the above analysis pipeline. 
Fig. lTTIT^ andlTTjfl are a good but random representation of 
the whole data in terms of magnitude, different variabilities 
observed as well as quality of the light curve in terms of 
scatter. The left plot in Fig. II ll shows the whole data 
set for the three epochs, each made up of 20 days 
each and the right plot is a blow-up of a few days of 
data where the variability is clearly evident. Further 
individual study and analysis of the data will be presented 
in a sequel to this paper. 

HP 26322 : (= 44 Tau) A F2 star, 8 Scuti variable 
with 49 known pulsatio nal frequenc ie s ( V mag = 5.39) in 
15 independent modes (|Lenz et al.J l|2010ri and references 
therein). The evolutionary phase of this well studied object 
is still unclear and under investigation. STEREO has data 
for three epochs (Left plot in Fig. Illfl and we were able to 
identify at least 15 p ulsational frequencies using Period04 
l|Lenz fc Breger.ll2004h . 

HP 14147 : (= AD Ari ) A F 8 Sc uti variable of 
V mag of 7.43. E ven-though IZhouJ (2002), identifies two 
different periods, iRodrfguez et al.T (|2000h find only one 
period at 0.2699d. Our analysis using Period04 gives a value 
of 0.269d but a phase folded plot at this period, gives two 
distinct amplitudes. The top right plot of Fig. [12] is folded 
over twice the period found (0.54d) and from this plot, we 
suspect that this could be a binary system rather a 8 Scuti 



6 Normalised flux in these figures are the actual flux val- 
ues divided by the median values and added one. 



variable. 

HP 31679: (= V1061 Tau ) A recently discovered 
ijTerrell. Williams fc Kaiser J 1 19951 ) B5, eclipsing binary of 
the j3 Lyrae type with a V = 7.95. They found it to be a 
system with a mass ratio of 2.407 and also speculate on the 
probable presence of a third light in the light curve, which 
is not confirmed yet. STRESS light curve exhibits a period 
of 1.379d. 

HP 37821 : (= V1238 Tau) An AO eclipsing binary of 
the W Uma type of a V = 8.5. The All Sky Automated 
Survey Catalogue of Variable Stars (ACVS) defines a period 
of 1.12173d. 

HP 93077 : (= EX Leo) A F6 eclipsing binary star of 
the W Uma Type with a known pe riod of 0.40860d with 
a B mag of 8.71. H er et al.J (|2009l ). used this star as a 
sample in their re-calibration of the Period - Luminosity 
Colour relationship of the W Uma type Binaries. It was 
also studied by the N2K consortium using the KPNO low 
resolution spectra in or der to identify targets for the Keck 
Planet search program (|Robinson et al.j|2007l ). 

Below, we present a sample of new variables 
(Fig. I13[) discovered in our data using our variability 
analysis pipeline. This analysis algorithm and complete 
catalogue will be discussed in a forthcoming paper in this 
series. Hence we would like to stress the fact that these 
results are presented to just give a flavour to the reader and 
needs further investigation. 

HP 20756 : (= 61 Tau) A B5 star of V mag 5.3 
in a multiple system as per Simb ad's classification and 
catalo gued as an eclipsing binary bv lEggleton fc Tokovinin] 
(2008). From our light curve, we find two prominent periods 
at 1.105d and 1.492d. As you can see these indicate a 
controversy with the known literature and needs further 
investigation. 
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HD 37821 : Spect. Type - AO: V mag - 8.5 
1.2 




HD 93077 : Spect. Type - F5: B mag - 8.7 
1 .1 5 [ 



0.4 0.6 
Phase 




0.4 0.6 
Phase 



Figure 12. STRESS sample light curves : The above are plots of known variable stars of different types of variability folded on the 
period found from the STRESS light curve. Again we are stressing that this is only a sample of our data and even these data needs to 
be further analysed to realise the full potential of the science that can be derived from them. 
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Figure 13. HD 20756 exhibits two prominent period in the STRESS light curve. The left plot is a light curve for one epoch of our data 
and the right is the LS periodogram. The data exhibits two clear periods at 1.105d and 1.492d. 
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Figure 13. Two new variables with no previous literature on its variability. The left is a plot of HD 222891 with shallow eclipses of period 
1.5949d and also with out-of-eclipse pulsations. The right is a plot of BD+02 2686, a faint eclipsing binary with no known literature, 
having a period of 0.77d. 



HD 222891 : With a V = 8.07 and spectral type F8, 
this star was studied by the N2K consortium as a potential 
target for the Keck planet search program but was rejected 
based on its atmospheric parameters l|Robinson et al.J 
12007ft . We find well defined eclipses with period of 1.5949d, 
which are just out of bounds, i.e., too deep for a planetary 
companion. This light curve also illustrates that we can 
detect shallow eclipses of less than half percent easily in 
our data. The presence of additional out-of-eclipse pulsa- 
tions, nevertheless makes this an interesting system to solve. 



the sampling rate (18 cdT 1 and 6 cdT 1 for HIT and HI-2 
respectively), sky coverage (60 percent of sky when all 
instruments combined), magnitude range (brighter than 12) 
and also its potential to be combined with other STEREO 
HI instruments (180 days of data in total) and hence can be 
used for long term study as well. And also round-the-clock 
observation of the spacecraft's unique field of view in the 
ecliptic, gives STRESS the unique place in the current day 
exoplanet and stellar variability search programs. 



BD +02 2686 : Known as a G5 star of V = 9.75 
with no other literature. Our light curve clearly shows the 
presence of both primary and secondary eclipses with a 
primary period of 0.774d. 



6 SUMMARY 

In this paper, we have described the semi-automated 
stellar photometry pipeline developed for the reduction of 
STEREO HI images. Using this pipeline we have collected 
photometric information for nearly a million stars covering 
37 months from April 2007 till March 2010, with most stars 
having data for three epochs and some for four epochs. We 
have also outlined the general characteristics of the data 
along with some of the systematic effects found in the data 
and the different methods used to remove these systematic 
trends. Even though the cause of many of these effects are 
still not understand completely and needs further analysis, 
the relative scatter of most data brighter than R = 8, is 
less than one percent and for fainter objects a few percent 
at best. Finally, we present a sample of the interesting light 
curves in our data. The data which we have accumulated 
till date have a lot of potential for further study in terms of 
variability analysis of known, suspected and new, unknown 
objects as well as for exoplanetary analysis. We also plan to 
release this data as a photometric catalogue to the general 
public in the future, due to its unique characteristics, like 
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